SEQUENCE LISTING 



1) GENERAL INFORMATION: 

(i) APPLICANT: Wilson, Peter J 

Morris, Charles P 
Anson, Donald S 
Occhiodoro, Teresa 
Bielicki , Julie 
Clements, Peter R 
Hopwood, John J 

(ii) TITLE OF INVENTION: GL YCOS YLAT I ON VARIANTS OF 



(iii) NUMBER OF SEQUENCES: 15 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Scully, Scott, Murphy & Presser 

(B) STREET: .400 Garden City Plaza 

(C) CITY: Garden City 

(D) STATE: New York 

( E ) COUNTRY : USA 

(F) ZIP: 11530 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) . COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1. 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 991,973 

(B) FILING DATE: 17-DEC-1992 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: DiGiglio, Frank S 

(B) REGISTRATION NUMBER: 31,34 6 

(C) REFERENCE/DOCKET NUMBER: 8416Z 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 516-742-4343 

(B) TELEFAX: 516-742-4366 

(C) TELEX: 230 901 SANS UR 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 125.. 1774 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CGGCTGTGTT GCGCAGTCTT CATGGGTTCC CGACGAGGAG GTCTCTGTGG CTGCGGCGGC 60 

TGCTAACTGC GCCACCTGCT GCAGCCTGTC CCCGCCGCTC TGAAGCGGCC GCGTCGAAGC 120 

@GAA ATG CCG CCA CCC CGG ACC GGC CGA GGC CTT CTC TGG CTG GGT CTG 16 9 

«f Met Pro Pro Pro Arg Thr Gly Arg Gly Leu Leu Trp Leu Gly Leu 
h t 1 5 10 15 

GJTT CTG AGC TCC GTC TGC GTC GCC CTC GGA TCC GAA. ACG CAG GCC AAC 217 
l&l Leu Ser Ser Val Cys Val Ala Leu Gly Ser Glu Thr Gin Ala Asn 
H 20 25 : 30 

TCG ACC ACA GAT GCT CTG AAC GTT CTT CTC ATC ATC GTG GAT GAC CTG 265 
.Ser Thr Thr Asp Ala Leu Asn Val Leu Leu lie lie Val Asp Asp Leu 
ru 35 40 45 

P[PC CCC TCC CTG GGC TGT TAT GGG GAT AAG CTG GTG AGG TCC CCA AAT 313 
Ajg Pro Ser Leu Gly Cys Tyr Gly Asp Lys Leu Val Arg Ser Pro Asn 
k >. 50 55 60 

ATT GAC CAA CTG GCA TCC C AC AGC CTC CTC TTC CAG AAT GCC TTT GCG 361 
He Asp Gin Leu Ala Ser His Ser Leu Leu Phe Gin Asn Ala Phe Ala 
65 70 75 

CAG CAA GCA GTG TGC GCC CCG AGC CGC GTT TCT TTC CTC ACT GGC AGG 409 
Gin Gin Ala Val Cys Ala Pro Ser Arg Val Ser Phe Leu Thr Gly Arq 
80 85 90 95 

AGA CCT GAC ACC ACC CGC CTG TAC GAC TTC AAC TCC TAC TGG AGG GTG 457 
Arg Pro Asp Thr Thr Arg Leu Tyr Asp Phe Asn Ser Tyr Trp Arg Val 

100 105 110 

CAC GCT GGA AAC TTC TCC ACC ATC CCC CAG TAC TTC AAG GAG AAT GGC 505 
His Ala Gly Asn Phe Ser Thr He Pro Gin Tyr Phe Lys Glu Asn Gly 
115 120 125 
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TAT GTG ACC ATG TCG GTG GGA AAA GTC TTT CAC CCT. GGG ATA TCT TCT 553 
Tyr Val Thr Met Ser Val Gly Lys Val Phe His Pro Gly He Ser Ser 
130 i35 140 

AAC CAT ACC GAT GAT TCT CCG TAT AGC TGG TCT TTT CCA CCT TAT CAT 601 
Asn His Thr Asp Asp Ser Pro Tyr Ser Trp Ser Phe Pro Pro Tyr His 
145 150 155 

CCT TCC TCT GAG AAG TAT GAA AAC ACT AAG ACA TGT CGA GGG CCA GAT 649 
Pro Ser Ser Glu Lys Tyr Glu Asn Thr Lys Thr Cys Arg Gly Pro Asp 
160 165 170 175 

GGA GAA CTC CAT GCC AAC CTG CTT TGC CCT GTG GAT GTG CTG GAT GTT 69 7 

Gly Glu Leu His Ala Asn Leu Leu Cys Pro Val Asp Val Leu Asp Val 

180 185 190 

CCC GAG GGC ACC TTG CCT GAC AAA CAG AGC ACT GAG CAA GCC ATA CAG 745 
Pro Glu Gly Thr Leu Pro Asp Lys Gin Ser Thr Glu Gin Ala He Gin 
195 200 205 

TTG TTG GAA AAG ATG AAA ACG TCA GCC AGT CCT TTC TTC CTG GCC GTT 793 

Leu Leu Glu Lys Met Lys Thr Ser Ala Ser Pro Phe Phe Leu Ala Val 

3 210 215 220 

d|G TAT CAT AAG CCA CAC ATC CCC TTC AGA TAC CCC AAG GAA TTT CAG 841 

G|y Tyr His Lys Pro His He Pro Phe Arg Tyr Pro Lys Glu Phe Gin 
S 225 230 235 

ASG TTG TAT CCC TTG GAG AAC ATC ACC CTG GCC CCC GAT CCC GAG GTC 889 

Lys Leu Tyr Pro Leu Glu Asn lie Thr Leu Ala Pro Asp Pro Glu Val 
2;4 0 245 250 255 

CCT GAT GGC CTA CCC CCT GTG GCC TAC AAC CCC TGG ATG GAC ATC AGG 937 

Eeo Asp Gly Leu Pro Pro Val Ala Tyr Asn Pro Trp Met Asp He Arg 
rU 260 265 270 

QAA CGG GAA GAC GTC CAA GCC TTA AAC ATC AGT GTG CCG TAT GGT CCA 985 

Gin Arg Glu Asp Val Gin Ala Leu Asn He Ser Val Pro Tyr Gly Pro 

275 280 285 

ATT CCT GTG GAC TTT CAG CGG AAA ATC CGC CAG AGC TAC TTT GCC TCT 1033 
He Pro Val Asp Phe Gin Arg Lys lie Arg Gin Ser Tyr Phe Ala Ser 
290 295 300 

GTG TCA TAT TTG GAT ACA CAG GTC GGC CGC CTC TTG AGT GCT TTG GAC 1081 
Val Ser Tyr Leu Asp Thr Gin Val Gly Arg Leu Leu Ser Ala Leu Asp 
305 310 315 

GAT CTT CAG CTG GCC AAC AGC ACC ATC ATT GCA TTT ACC TCG GAT CAT 1129 
Asp Leu Gin Leu Ala Asn Ser Thr He He Ala Phe Thr Ser Asp His 
320 325 330 335 
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GGG TGG GCT CTA GGT GAA CAT GGA GAA TGG GCC AAA TAC AGC AAT TTT 1177 
Gly Trp Ala Leu Gly Glu His Gly Glu Trp Ala Lys Tyr Ser Asn Phe 

340 345 350 

GAT GTT GCT ACC CAT GTT CCC CTG ATA TTC TAT GTT CCT GGA AGG ACG 1225 
Asp Val Ala Thr His Val Pro Leu He Phe Tyr Val Pro Gly Arg Thr 
355 360 365 

GCT TCA CTT CCG GAG GCA GGC GAG AAG CTT TTC CCT TAC CTC GAC CCT 127 3 

Ala Ser Leu Pro Glu Ala Gly Glu Lys Leu Phe Pro Tyr Leu Asp Pro 
370 375 380 

TTT GAT TCC GCC TCA CAG TTG ATG GAG CCA GGC AGG CAA TCC ATG GAC 1321 
Phe Asp Ser Ala Ser Gin Leu Met Glu Pro Gly Arg Gin Ser Met Asp 
385 390 395 

CTT GTG GAA CTT GTG TCT CTT TTT CCC ACG CTG GCT GGA CTT GCA GGA 1369 
Leu Val Glu Leu Val Ser Leu Phe Pro Thr Leu Ala Gly Leu Ala Gly 
400 405 410 415 

CTG CAG GTT CCA CCT CGC TGC CCC GTT CCT TCA TTT CAC GTT GAG CTG 1417 
Leu Gin Val Pro Pro Arg Cys Pro Val Pro Ser Phe His Val Glu Leu 

420 425 430 

*JC AGA GAA GGC AAG AAC CTT CTG AAG CAT TTT CGA TTC CGT GAC TTG 1465 
Cys Arg Glu Gly Lys Ash Leu Leu Lys His Phe Arg Phe Arg Asp Leu 
% t : 4 35 440 445 



GAG GAT CCG TAC CTC CCT GGT AAT CCC CGT GAA CTG ATT GCC TAT 1513 
Glu Glu Asp Pro Tyr Leu Pro Gly Asn Pro Arg Glu Leu lie Ala Tyr 
* m 450 455 460 

#C CAG TAT CCC CGG CCT TCA GAC ATC CCT CAG TGG AAT TCT GAC AAG 1561 
Ser Gin Tyr Pro Arg Pro Ser Asp He Pro Gin Trp Asn Ser Asp Lys 
fU 465 470 475 

CjjG AGT TTA AAA GAT ATA AAG ATC ATG GGC TAT TCC ATA CGC ACC ATA 1609 
Pro Ser Leu Lys Asp He Lys lie Met Gly Tyr Ser lie Arg Thr He 
480 485 490 495 

GAC TAT AGG TAT ACT GTG TGG GTT GGC TTC AAT CCT GAT GAA TTT CTA 1657 
Asp Tyr Arg Tyr Thr Val Trp Val Gly Phe Asn Pro Asp Glu Phe Leu 

500 505 510 

GCT AAC TTT TCT GAC ATC CAT GCA GGG GAA CTG TAT TTT GTG GAT TCT 1705 
Ala Asn Phe Ser Asp He His Ala Gly Glu Leu Tyr Phe Val Asp Ser 
515 520 525 

GAC CCA TTG CAG GAT CAC AAT ATG TAT AAT GAT TCC CAA GGT GGA GAT 1753 
Asp Pro Leu Gin Asp His Asn Met Tyr Asn Asp Ser Gin Gly Gly Asp 
530 535 540 
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CTT TTC CAG TTG TTG ATG CCT TGAGTTTTGC CAACCATGGA TGGCAAATGT 1804 
Leu Phe Gin Leu Leu Met Pro 
545 550 

GATGTGCTCC CTTCCAGCTG GTGAGAGGAG GAGTTAGAGC TGGTCGTTTT GTGATTACCC 1864 

ATAATATTGG AAGCAGCCTG AGGGCTAGTT AATCCAAAGA TGCATCAACA ATTTGGCCTG 1924 

AGAATATGTA ACAGCCAAAC CTTTTCGTTT AGTCTTTATT AAAATTTATA ATTGGTAATT 1984 

GGACCAGTTT TTTTTTTAAT TTCCCTCTTT TTAAAACAGT TACGGCTTAT TTACTGAATA 2044 

AATACAAAGC AAACAAACTC AAGTTATGTC ATACCTTTGG AT AC GA AG AC CATACATAAT 2104 

AACCAAACAT AACATTATAC ACAAAGAATA CTTTCATTAT TTGTGGAATT TAGTGCATTT 2164 

CAAAAAGTAA TCATATATCA AACTAGGCAC CACACTAAGT TCCTGATTAT TTTGTTTATA 2224 

ATTTAATAAT ATATCTTATG AGCCCTATAT ATTCAAAATA TTATGTTAAC ATGTAATCCA 2284 

TGTTTC TTTT TCC 2297 

(1) INFORMATION FOR SEQ ID NO: 2: 

r y (i) SEQUENCE CHARACTERISTICS: 

L :3 (A) LENGTH: 550 amino acids 

(B) TYPE:. amino acid 
H (D) TOPOLOGY: linear 

=_ (ii) MOLECULE TYPE: protein 

fy (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Pro Pro Pro Arg Thr Gly Arg Gly Leu Leu Trp Leu Gly Leu Val 
1 5 10 15 

Leu Ser Ser Val Cys Val Ala Leu Gly Ser Glu Thr Gin Ala Asn Ser 
20 25 30 

Thr Thr Asp Ala Leu Asn Val Leu Leu lie lie Val Asp Asp Leu Arg 
35 40 45 

Pro Ser Leu Gly Cys Tyr Gly Asp Lys Leu Val Arg Ser Pro Asn lie 
50 55 60 

Asp Gin Leu Ala Ser His Ser Leu Leu Phe Gin Asn Ala Phe Ala Gin 
65 70 75 80 

Gin Ala Val Cys Ala Pro Ser Arg Val Ser Phe Leu Thr Gly Arg Arg 

85 90 95 

Pro Asp Thr Thr Arg Leu Tyr Asp Phe Asn Ser Tyr Trp Arg Val His 
100 105 110 
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Ala Gly Asn Phe Ser Thr lie Pro Gin Tyr Phe Lys Glu Asn Gly Tyr 
115 120 125 

Val Thr Met Ser Val Gly Lys Val Phe His Pro Gly He Ser Ser Asn 
130 135 140 

His Thr Asp Asp Ser Pro Tyr Ser Trp Ser Phe Pro Pro Tyr His Pro 
145 150 155 160 

Ser Ser Glu Lys Tyr Glu Asn Thr Lys Thr Cys Arg Gly Pro Asp Gly 

165 170 175 

Glu Leu His Ala Asn Leu Leu Cys Pro Val Asp Val Leu Asp Val Pro 
180 185 190 

Glu Gly Thr Leu Pro Asp Lys Gin Ser Thr Glu Gin Ala He Gin Leu 
195 200 205 

Leu Glu Lys Met Lys Thr Ser Ala Ser Pro Phe Phe Leu Ala Val Gly 
210 215 220 

Tyr His Lys Pro His He Pro Phe Arg Tyr Pro Lys Glu Phe Gin Lys 
225 230 235 240 

Tyr Pro Leu Glu Asn He Thr Leu Ala Pro Asp Pro Glu Val Pro 
-r 245 250 255 

Ajp Gly Leu Pro Pro Val Ala Tyr Asn Pro Trp Met Asp He Arg Gin 
260 265 270 

Arg Glu Asp Val Gin Ala Leu Asn He Ser Val Pro Tyr Gly Pro He 
□ 275 280 285 

PJo Val Asp Phe Gin Arg Lys He Arg Gin Ser Tyr Phe Ala Ser Val 
?y 290 295 300 

S|r Tyr Leu Asp Thr Gin Val Gly Arg Leu Leu Ser Ala Leu Asp Asp 
3@ 5 310 315 320 

Leu Gin Leu Ala Asn Ser Thr He He Ala Phe Thr Ser Asp His Gly 

325 330 335 

Trp Ala Leu Gly Glu His Gly Glu Trp Ala Lys Tyr Ser Asn Phe Asp 
340 345 350 

Val Ala Thr His Val Pro Leu He Phe Tyr Val Pro Gly Arg Thr Ala 
355 360 ' 365 

Ser Leu Pro Glu Ala Gly Glu Lys Leu Phe Pro Tyr Leu Asp Pro Phe 
370 375 380 

Asp Ser Ala Ser Gin Leu Met Glu Pro Gly Arg Gin Ser Met Asp Leu 
385 390 395 400 
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Val Glu Leu Val Ser Leu Phe Pro Thr Leu Ala Gly Leu Ala Gly Leu 

405 410 415 

Gin Val Pro Pro Arg Cys Pro Val Pro Ser Phe His Val Glu Leu Cys 
420 425 430 

Arg Glu Gly Lys Asn Leu Leu Lys His Phe Arg Phe Arg Asp Leu Glu 
435 440 445 

Glu Asp Pro Tyr Leu Pro Gly Asn Pro Arg Glu Leu lie Ala Tyr Ser 
450 455 460 

Gin Tyr Pro Arg Pro Ser Asp lie Pro Gin Trp Asn Ser Asp Lys Pro 
465 470 475 480 

Ser Leu Lys Asp lie Lys lie Met Gly Tyr Ser lie Arg Thr lie Asp 

485 490 495 

Tyr Arg Tyr Thr Val Trp Val Gly Phe Asn Pro Asp Glu Phe Leu Ala 
500 505 510 

A|n Phe Ser Asp lie His Ala Gly Glu Leu Tyr Phe Val Asp Ser Asp 
3 515 520 525 

Pjto Leu Gin Asp His Asn Met Tyr Asn Asp Ser Gin Gly Gly Asp Leu 

*g 530 535 540 

ffie Gin Leu Leu Met Pro 
545 550 

f2) INFORMATION FOR SEQ ID NO: 3: 

fU (i) SEQUENCE CHARACTERISTICS: 

rU (A) LENGTH: 16 amino acids 

M3 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Pro Arg Glu Leu lie Ala Tyr Ser Asn Tyr Pro Arg Asn Asn lie Pro 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
ACTAGTAGCA CCTGCTGGAC GCCGGGAGGG ACCCGCTGAT GCTGCTGCA 4 9 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

w Thr Ser Ala Leu Asn Val Leu Leu lie lie Val Asp Asp Leu Arg Pro 

JH 1 5 10 15 

=P Ser Leu Gly Asp Tyr Asp Asp Val Leu 
%0 20 25 

Bj INFORMATION FOR SEQ ID NO: 6: 

= (i) SEQUENCE CHARACTERISTICS: 

fU (A) LENGTH: 442 8 base pairs 

U (B) TYPE: nucleic acid 

ry ( c ) STRANDEDNESS: double 

[ n (D) TOPOLOGY: linear 

% = (ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 332.. 434 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 536., 537 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 693. .829 
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(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 962.. 963 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1044.. 1221 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 1350. .1351 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1480.. 1569 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 1716. .1717 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1841.. 2041 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 2206.. 2207 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2 2 94.. 24 6 4 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 2585.. 2586 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2684 . .2810 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 2904.. 2905 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3033. .3206 

( ix ) FEATURE : 

(A) NAME /KEY : intron 

(B) LOCATION: 3308.. 3309 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3435. .3908 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
GCGATCTAGA CCTAGTTAGC CAAGTCTCTA AC GTGAC ATA GGGAAAGCTT GCAATGGCAA 
CTGGCCGCCC GTCTGCGCCT GTCTCTCGCC ACGCCTATTG CTGCAGGATG ACGCGCACCT 
CTATGAACCC GCCGTGAGGT GTGAGTGTGA CGCAGGGAAG AGTCGC AC GG ACGCACTCGC 
^CTGCGGCCA GCTGCGGGCC CGGGCGGCGG CTGTGTTGCG CAGTCTTCAT GGGTTCCCGA 
^GAGGAGGTC TCTGTGGCTG CGGCGGCTGC TAACTGCGCC ACCTGCTGCA GCCTGTCCCC 
QCCGCTCTGA AGCGGCCGCG TCGAAGCCGA AATGCCGCCA CCCCGGACCG GCCGAGGCCT 
iJCTCTGGCTG GGTCTGGTTC TGAGCTCCGT CTGCGTCGCC CTCGGATCCG AAACGCAGGC 
jgAACTCGACC ACAGGTGCCG CCCACGCCCT CCCTGCCATC TCTTCTCCCT TCCTCCCTCC 
gTTCCTTCCT CCTTCCTTCT TTCCTTCCTT CTTTGTTTAT ATCCATTCTT TTTACCCCCC 
^CTCCCACCC TTGC TGAGGC ACAGCGCCCT CCCTGGCTAG GCTGTTAGGT GCAGGGTCCA 
xicCTTGGGCC TCTTAGTAAC CTAGCACCTA CCATGAGGGA GGGTTCAGTG TCAGTGCAGG 
TTACCTCACC AAAGCCCCTC CCTCCTGTGT AGATGCTCTG AAC GTTCTTC TCATCATCGT 
GGATGACCTG CGCCCCTCCC TGGGCTGTTA TGGGGATAAG CTGGTGAGGT CCCCAAATAT 
TGACCAACTG GCATCCCACA GCCTCCTCTT CCAGAATGCC TTTGCGCAGG TATGTC TGGG 
AACCTCTAGC TGTGGGTGTG TGCTGCTTCG TGCACTGAGG GTTGGGGGCG GGGAGCTTCA 
GCTATTGTCA GATGGCACAG ATTGTGC GGG ACATCTTGTT AGAGGGAAGC ATAGTCTGGA 
AAAGGGCGGT TGCTTGGTTA CCTAAGAGAT GGCAGACATG TTTTGCTGTG GCGATGCTTA 
CCTCTGCTTC TGCTCCCTAA CAGCAAGCAG TGTGCGCCCC GAGCCGCGTT TCTTTCCTCA 
CTGGCAGGAG ACCTGACACC ACCCGCCTGT ACGACTTCAA CTCCTACTGG AGGGTGCACG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 





CTGGAAACTT 


CTCCACCATC 


CCCCAGTACT 


TCAAGGAGAA 


TGGCTATGTG 


AC C ATGTC GG 


1200 


TGGGAAAAGT 


CTTTCACGCT 


GGTACTGCTC 


CATGTCCAGA 


GTCTGGGTTC 


TCTTGGTTTG 


1260 


TGGTGTCTGA 


NTCCAGCATT 


CCCATCCTGG 


GGATGGGCTG 


TCTTTGCAGA 


GCCCTCTTCT 


1320 


GGCTGGGCGA 


GTCCCTCGCT 


AGTCAGTGCT 


TTTGTAGATG 


AGGA A AC TGA 


GCCCCAAAGA 


1380 


AGGGAGGNTC 


CACTTGCCCA 


TTTGTTTACA 


GAGTTTTAAT 


TATGGGGAGT 


GGGGTGTTGA 


1440 


AAGACTCATC 


ATGTTTTAAC 


AACCTTTTTT 


TTTTTCCAAG 


GGATATCTTC 


TAACCATACC 


1500 


GATGATTCTC 


CGTATAGCTG 


GTCTTTTCCA 


CCTTATCATC 


CTTCCTCTGA 


GAAGTATGAA 


1560 


AACACTAAGG 


TAAGGCTGTG 


AAAGGGACAT 


TTCTGAAGAG 


GAACC ACTTT 


TTCCTTTGTC 


1620 


ACATAAACTA 


C TGGGTAT AC 


TGCATGTNCT 


GTGAAGCTGG 


TTATATACCA 


CGAAGTTGTG 


1680 


GGTTTCATTT 


GTGATAATGT 


TTTGACAGAA 


GTAAGTTGTT 


CAGTCTGAGT 


GACTAACACG 


1740 


TGAAGGGCTG 


ATTATGTGAA 


CATTAAATCT 


GTGTGTGTAG 


CCTTCATGGC 


TTCATNTCTT 


1800 


XgCACTTAAAA 


AGCTGATGTT 


ATATTATTTT 


GTTTTGAAAG 


ACATGTCGAG 


GGCCAGATGG 


1860 


&GAACTCCAT 


GCCAACCTGC 


TTTGCCCTGT 


GGATGTGCTG 


GATGTTCCCG 


AGGGCACCTT 


1920 


JjCCTGACAAA 


CAGAGCACTG 


AGCAAGCCAT 


ACAGTTGTTG 


GAAAAGATGA 


AAACGTCAGC 


1980 


JGAGTCCTTTC 


TTCCTGGCCG 


TTGGGTATCA 


T A AGC C AC AC 


ATCCCCTTCA 


GAT AC C C C A A 


2040 


hGGTGAAGAGC 


TGGTTGAGGG 


CTGATCCAGC 


ACAGCTGTGA 


CAGCTGTGTT 


GTTTGTTGAG 


2100 


fSGAGGGATTT 


GCACAGGGAA 


GGTGGCTACA 


TCCTGCCATC 


GCCAGGCACC 


ATGGTTGCCT 


2160 


^QATGGGCACT 


AGTGTCCTCA 


GTGGAGTAAA 


GATGGGATTT 


AGAGGTAAAA 


GGCAGTATAG 


2220 


•ACAGTGATAG 


AGCCACAAGC 


TTGTGCTTTT 


GCTAAAAGAG 


TGACAACTTT 


GTGGCTTTGT 


2280 


G 1TTTTCCCC 


AAGGAATTTC 


AGAAGTTGTA 


TCCCTTGGAG 


AACATCACCC 


TGGCCCCCGA 


2340 


TC C C G AGGTC 


CCTGATGGCC 


TACCCCCTGT 


GGCCTACAAC 


CCCTGGATGG 


ACATCAGGCA 


2400 


ACGGGAAGAC 


GTCCAAGCCT 


TAAACATCAG 


TGTGCCGTAT 


GGTCCAATTC 


CTGTGGACTT 


2460 


TCAGGTATCA 


AGGACATAGT 


TTGGGGATGT 


ATTGGAC AC T 


GATGACATAG 


TGTCGTAGGT 


2520 


GAAACCACTC 


TTCTCAGTAG 


ACACAACTCC 


ACCTATAATG 


TCTTATTAAG 


AGCTTTCTTT 


2580 


GTGTGTAGGG 


ATTGGGAGAG 


ATGCACACGG 


CAAGCATTAT 


CTCTGTATGC 


CTTGGCAATT 


2640 


TAAATTGCAG 


TCACTCTCAT 


TTTTATTTTT 


TTTCAATTTG 


C AGC GG A AAA 


TCCGCCAGAG 


2700 


CTACTTTGCC 


TCTGTGTCAT 


ATTTGGATAC 


ACAGGTCGGC 


CGCCTCTTGA 


GTGCTTTGGA 


2760 





CGATCTTCAG 


CTGGPPAAPA 

v— X VJVJ v-> v->/A/Av^/A 


r*C APp a m/^ a m 


TGCATTTACC 


TCGGATCATG 


GTAAGCATTT 


2820 


TGAAATTCCC 


Tf^nTfi AHTP A 

X VJVJ JL vjT/AVJ J. C A 


a a a r* a Tiprnp a 
A A AC A 1 C I OA 


ACTTTCCTGT 


GAAACATGCT 


TTGCAAAATT 


2880 


GCC ATTGAP A 

wv-»v-«ir»x x vjrw^ /A 


X iHAAC A 1 VjVjO 


m /™* m /™i m m m /™i mm 

I GTGTTTCTT 


CTAGGTGATG 


AGTTTCTACT 


TCCTCTGGTT 


2940 


x x x xAv-./A/AV_./A vJ 


z^ 1 AAA m/^ a a it m 
O AAA I OA AAT 


GGTATCTAAA 


ATAAACAAGC 


TGTGGTATGA 


TGATTATTCA 


3000 


TTTTPTC!TP A 

x X X J. Vj> X VJ J. v^/A 


r p r p f T 1 f' T C* rprri 
X iClbl OC i X 


TTTATGAACT 


AGGGTGGGCT 


CTAGGTGAAC 


ATGGAGAATG 


3060 


GGC C A AAT AC 

VJ v^ V-* * li JL4 X J. V«* 


/AVI V-./A/A 111 lb 


A 1 <j 1 1 OCT AC 


CCATGTTCCC 


CTGATATTCT 


ATGTTCCTGG 


3120 


AAGGACGGPT 


x ^nL i J. v^ vjvj 


AOOCAGGCGA 


GAAGCTTTTC 


CCTTACCTCG 


ACCCTTTTGA 


3180 


TTCCGPPTPA 


V-./AVJ X X VjA X Vjvj 


AOCCAGGTAT 


A A A AT ATGC T 


GAAATGATAT 


TGCTTGACAG 


3240 


TAAGATPAPP 


XXX Avj X X X A X 


A I GTGAACCA 


CTTTATTGAA 


TCATAGGCTf 


TGGGGTTACA 


3300 


*x VJin\_. v^ v^ /Art 


A vj A X AAA X vJvJ 


m/~tm n n it trim i% ■» 

TGTAAATTAA 


AAAAAGAAAA 


CATATGGAGC 


CCAGACAGGG 


3360 


TCCTTTAPTd 


v- X CC X VjLL X vJ 


OCCATOGCAG 


GCTTTTATAA 


TGTAACCCAT 


TCTGCTCTGT 


3420 


^dGCTTCCTGT 


ttp a n nr a cxn. 

x x v_» in vjvj V-. /A VjvJ 


LftA X v- C A X OO 


ACCTTGTGGA 


ACTTGTGTCT 


CTTTTTCCCA 


3480 


^Sgctggctgg 


ACTTnrAnnA 

**v-> x x vjVw xAVj vJ/A 


ptpp a ppttpp 

L X bLAbb I1L 


CACCTCGCTG 


CCCCGTTCCT 


TCATTTCACG 


3540 


% 5tgagctgtg 


P AfJAPA Af-lPr 1 

v^/AvJjt^vJ/A/AvJvjC 


A AO A AC C TIC 


TGAAGCATTT 


TCGATTCCGT 


GACTTGGAAG 


3600 


ytGGATCPCSTA 

_ «vjun jl v*> oVJ x /A 


v-C X CCC X vjO X 


AA I CCCCGTG 


AACTGATTGC 


CTATAGCCAG 


TATCCCCGGC 


3660 


=P TTf A CI A P A m 
x J- v^rAvjTfAv^.r\ X 


COO 1 CAOIOO 


AATTCTGACA 


AGCCGAGTTT 


AAAAGATATA 


AAGATCATGG 


3720 


H3CTATTCPAT 


*aL^vjv-,/aCCA X A 


OAC T ATAGGT 


ATACTGTGTG 


GGTTGGCTTC 


AATCCTGATG 


3780 


rAATTTCTAGP 


x /A/Av-» 1 1 1 1^1 


/*** a a rnp a tt\i~* 
OAO A 1 CCATO 


CAGGGGAACT 


GTATTTTGTG 


GATTCTGACC 


3840 


"Sattgcagga 


TPAfAATATn 

x v»riv-r\rl x /A x vj 


rp a rri a a rrip a mm 
X /\ X A A X vj A X X 


CCCAAGGTGG 


AGATCTTTTC 


CAGTTGTTGA 


3900 


TGCCTTfiAfiT 

x VJ v.* V-< X X Vjl»Vj JL 


X X X vjCCAACC 


ATGGATGGCA 


AATGTGATGT 


GCTCCCTTCC 


AGC TGGTG AG 


3960 


AGGAfiGAflTT 

«vjvjjtivJvJZAvJ j. x 


AP A PPfTPPfTP 
AvjAvjL- X OO 1 C 


m m m m m is m 

OTTTTGTGAT 


T AC C CAT A AT 


ATTGGAAGCA 


GCCTGAGGGC 


4020 


TAGTTAATPP 


A A AP A rnpp Am 
rA/\/AC A X vjC A X 


a A z™ 1 a a mmm/*** 
OA AC A ATI TG 


GCCTGAGAAT 


ATGTAACAGC 


CAAACCTTTT 


4080 


CGTTTAGTCT 


TTATTAAAAT 1 

x x /A J. J. /A/AiA/A X 


mm A m t\ A mm/~»/~« 
X XAXAAX I OO 


TAATTGGACC 


AGTTTTTTTT 


TTAATTTCCC 


4140 


TCTTTTTAAA 


ACAGTTACGG 


CTTATTTACT 


GAATAAATAC 


AAAGCAAACA 


A AC TC A AGTT 


4200 


ATGTCATACC 


TTTGGATACG 


AAGACCATAC 


ATAATAACCA 


AACATAACAT 


TATACACAAA 


4260 


GAATACTTTC 


ATTATTTGTG 


GAATTTAGTG 


CATTTCAAAA 


AGTAATCATA 


TATCAAACTA 


4320 
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GGCACCACAC TAAGTTCCTG ATTATTTTGT TTATAATTTA ATAATATATC TTATGAGCCC 4 380 

TATATATTCA AAATATTATG TTAACATGTA ATCCATGTTT CTTTTTCC 4428 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Pro Arg Glu Leu lie Ala Tyr Ser Xaa Tyr Pro Arg Xaa Xaa lie Pro 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

V Z (A) LENGTH: 5 amino acids 

= t ( B ) TYPE : amino acid 

y (C) STRANDEDNESS: single 

y ( D) TOPOLOGY: linear 

W (ii) MOLECULE TYPE: peptide 

p (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

u Cys Thr Pro Ser Arg 

ru 1 5 

J2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Gly Lys Trp His Leu Gly 
1 5 




81 
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(2) INFORMATION FOR SEoTlD NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGCCTCTAGA CCAGCTACAG TCGGAAACCA TCAGCAAGCA GGTCATTGTT CCAACATGCC 60 
GCCACCCCGG ACCGGCCGAG G 
(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 amino acids 

r== (B) TYPE: amino acid 

;S (C) STRANDEDNESS: single 

Pf| (D) TOPOLOGY: linear 

% (ii) MOLECULE TYPE: protein 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

- Thr Arg Arg Pro Asn Val Val Leu Leu Leu Thr Asp Asp Gin Asp Glu 
h 5 10 15 

ru Val Leu Gly Gly Met Thr Pro Leu Lys Lys Thr Lys Ala Leu He Gly 

20 25 30 

q Glu Met Gly Met Thr Phe Ser Ser Ala Tyr Val Pro Ser Ala Leu Cys 
=y 35 40 45 

Cys Pro Ser Arg Ala Ser He Leu Thr Gly Lys Tyr Pro His Asn His 
50 55 60 

His Val Val Asn Asn Thr Leu Glu Gly Asn Cys Ser Ser Lys Ser Trp 
65 70 75 80 

Gin Lys He Gin Glu Pro Asn Thr Phe Pro Ala He Leu Arg Ser Met 

85 90 95 

Gin Gly Tyr Gin Thr Phe Thr Phe Phe Ala Gly Lys Tyr Leu Asn Glu 
100 105 110 

Tyr Gly Ala Pro Asp Ala Gly Gly Leu Glu His Val Pro Leu Gly Trp 
115 120 125 
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Ser Tyr Trp Tyr Ala Leu Glu Lys Asn Ser Lys Tyr Tyr Asn Tyr Thr 
130 135 140 

Leu Ser lie Asn Gly Lys Ala Arg Lys His Gly Glu Asn Tyr Ser Val 
145 150 155 160 

Asp Tyr Leu Thr Asp Val Leu Ala Asn Val Ser Leu Asp Phe Leu Asp 

165 170 175 

Tyr Lys Ser Asn Glu Glu Pro Phe Phe Met Met lie Ala Thr Pro Ala 
180 185 190 

Pro His Ser Pro Trp Thr Ala Ala Pro Gin Tyr Gin Lys Ala Phe Gin 
195 200 205 

Asn Val Phe Ala Pro Arg Asn Lys Asn Phe Asn lie His Gly Thr Asn 
210 215 220 

Lys His Trp Leu lie Arg Gin Ala Lys Thr Pro Met Thr Asn Ser Ser 
225 230 235 240 

lie Gin Phe Leu Asp Asn Ala Phe Arg Lys Arg Trp Gin Thr Leu Leu 

245 250 255 

Ser Val Asp Asp Leu Val Glu Lys Leu Val Lys Arg Leu Glu Phe Thr 
260 265 270 

Gly Glu Leu Asn Asn Thr Tyr lie Phe Tyr Thr Ser Asp Asn Gly Tyr 
275 280 285 

His Thr Gly Gin Phe Ser Leu Pro lie Asp Lys Arg Gin Leu Tyr Glu 
290 295 300 

Phe Asp lie Lys Val Pro Leu Leu Val Arg Gly Pro Gly lie Lys Pro 
305 310 315 320 

Asn Gin Thr Ser Lys Met Leu Val Ala Asn lie Asp Leu Gly Pro lie 

325 330 335 

Leu Asp lie Ala Gly Tyr Asp Leu Asn Lys Thr Gin Met Asp Gly Met 
340 345 350 

Ser Leu Leu Pro: lie Leu Arg Gly Ala Ser Asn Leu Thr Trp Arg Ser 
355 360 365 

Asp Val Leu Val Glu Tyr Gin Gly Glu Gly Arg Asn Val Thr Asp Pro 
370 375 380 

Thr Cys Pro Ser Leu Ser Pro Gly Val Ser Gin Cys Phe Pro Asp Cys 
385 390 395 400 

Val Cys Glu Asp Ala Tyr Asn Asn Thr Tyr Ala Cys Val Arg Thr Met 

405 410 415 
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Ser Ala Leu Trp Asn Leu Gin Tyr Cys Glu Phe Asp Asp Gin Glu Val 
420 425 430 

Phe Val Glu Val Tyr Asn Leu Thr Ala Asp Pro Asp Gin He Thr Asn 
435 440 445 

He Ala Lys Thr He Asp Pro Glu Leu Leu Gly Lys Met Asn Tyr Arg 
450 455 460 

Leu Met Met Leu Gin Ser Cys Ser Gly Pro Thr Cys Arg Thr Pro Gly 
465 470 475 480 

Val Phe Asp Pro Gly Tyr Arg Phe Asp Pro Arg Leu Met Phe Ser Asn 

485 490 495 

Arg Gly Ser Val Arg Thr Arg Arg Phe Ser Lys His Leu Leu 
500 505 510 

2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 amino acids. 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Gly Ala Pro Arg Ser Leu Leu Leu Ala Leu Ala Ala Gly Leu Ala 
1 5 10 15 

Val Ala Arg Pro Pro Asn He Val Leu He Phe Ala Asp Asp Leu Gly 
20 25 30 

Tyr Gly Asp Leu Gly Cys Tyr Gly His Pro Ser Ser Thr Thr Pro Asn 
35 40 45 

Leu Asp Gin Leu Ala Ala Gly Gly Leu Arg Phe Thr Asp Phe Tyr Val 
50 55 60 

Pro Val Ser Leu Gin Thr Pro Ser Arg Ala Ala Leu Leu Thr Gin Arg 
65 70 75 80 

Leu Pro Val Arg Met Gly Met Tyr Pro Gly Val Leu Val Pro Ser Ser 

85 90 95 

Arg Gly Gly Leu Pro Leu Glu Glu Val Thr Val Ala Glu Val Leu Ala 
100 105 110 

Ala Arg Gly Tyr Leu Thr Gly Met Ala Gly Lys Trp His Leu Gly Val 
115 ' " 120 125 
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Gly Pro Glu Gly Ala Phe Leu Pro Prp His Gin Gly Phe His Arq Phe 
130 135 140 

Leu Gly He Pro Tyr Ser His Asp Gin Gly Pro CyS Gin Asn Leu Thr 
145 150 155 160 

Cys Phe Pro Pro Ala Thr Pro Cys Asp Gly Gly Cys Asp Gin Gly Leu 

165 170 175 

Val Pro He Pro Leu Leu Ala Asn Leu Ser Val Glu Ala Gin Pro Pro 
180 185 190 

Trp Leu Pro Gly Leu Glu Ala Arg Tyr Met Ala Phe Ala His Asp Leu 
195 200 205 

Met Ala Asp Ala Gin Arg Gin Asp Arg Pro Phe Phe Leu Tyr Tyr Ala 
210 215 220 

Ser His His Thr His Tyr Pro Gin Phe Ser Gly Gin Ser Phe Ala Glu 
225 230 235 240 

Arg Ser Gly Arg Gly Pro Phe Gly Asp Ser Leu Met Glu Leu Asp Ala 

245 250 255 

Ala Val Gly Thr Leu Met Thr Ala He Gly Asp Leu Gly Leu Leu Glu 
260 265 270 

Glu Thr Leu Val He Phe Thr Ala Asp Asn Gly Pro Glu Thr Met Arg 
275 280 285 

Met Ser Arg Gly Gly Cys Ser Gly Leu Leu Arg Cys Gly Lys Gly Thr 
290 295 300 

Thr Tyr Glu Gly Gly Val Arg Glu Pro Ala Leu Ala Phe Trp Pro Gly 
305 310 315 320 

His He Ala Pro Gly Val Thr His Glu Leu Ala Ser Ser Leu Asp Leu 

325 330 335 

Leu Pro Thr Leu Ala Ala Leu Ala Gly Ala Pro Leu Pro Asn Val Thr 
340 345 350 

Leu Asp Gly Phe Asp Leu Arg Pro Pro Ala Ala Gly His Arg Gin Glu 
355 360 365 

Pro Ser Ala Val Ser Leu Leu Leu Pro Val Leu Pro Arg Arg Gly Pro 
370 375 380 

Trp Gly Phe Cys Cys Ala Asp Trp Lys Val Gin Gly Ser Leu Leu His 
385 390 395 400 

Pro Gly Ser Ala His Ser Asp Thr Thr Ala Asp Pro Ala Cys His Ala 

405 410 415 
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Ser Ser Ser Leu Thr Ala His Glu Pro Pro Leu Leu Tyr Asp Leu Ser 
420 425 430 

Lys Asp Pro Gly Glu Asn Tyr Asn Leu Leu Gly Gly Val Ala Gly Ala 
435 440 445 

Thr Pro Glu Val Leu Gin Ala Leu Lys Gin Leu Gin Leu Leu Lys Ala 
450 455 460 

Gin Leu Asp Ala Ala Val Thr Phe Gly Pro Ser Gin Val Ala Arq Gly 
465 470 475 480 

Glu Asp Pro Ala Leu Gin lie Cys Cys His Pro Gly Cys Thr Pro Arg 

485 490 495 

Pro Ala Cys Cys His Cys Pro Asp Pro His Ala 
500 505 

2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear : . # 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Gly Pro Arg Gly Ala Ala Ser Leu Pro Arg Gly Pro Gly Pro Arg 
1 5 10 15 

Arg Leu Leu Leu Pro Val Val Leu Pro Leu Leu Leu Leu Leu Leu Leu 
20 25 30 

Ala Pro Pro Gly Ser Gly Ala Gly Ala Ser Arg Pro Pro His Leu Val 
35 40 45 

Phe Leu Leu Ala Asp Asp Leu Gly Trp Asn Asp Val Gly Phe His Gly 
50 55 60 

Ser Arg He Arg Thr Pro His Leu Asp Ala Leu Ala Ala Gly Gly Val 
65 70 75 80 

Leu Leu Asp Asn Tyr Tyr Thr Gin Pro Leu Cys Thr Pro Ser Arg Ser 

85 90 95 

Glh Leu Leu Thr Gin Arg Tyr Gin He Arg Thr Gly Leu Gin His Gin 
100 105 110 

He He Trp Pro Cys Gin Pro Ser Cys Val Pro Leu Asp Glu Lys Leu 
115 120 125 
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Leu Pro Gin Leu Leu Lys Glu Ala Gly Tyr Thr Thr His Met Val Gly 
130 135 140 

Lys Trp His Leu Gly Met Tyr Arg Lys Glu Cys Leu Pro Thr Arg Arg 
145 150 155 160 

Gly Phe Asp Thr Tyr Phe Gly Tyr Leu Leu Gly Ser Glu Asp Tyr Tyr 

165 170 175 

Ser His Glu Arg Cys Thr Leu lie Asp Ala Leu Asn Val Thr Arg Cys 
180 185 190 

Ala Leu Asp Phe Arg Asp Gly Glu Glu Val Ala Thr Gly Tyr Lys Asn 
195 200 205 

Met Tyr Ser Thr Asn He Phe Thr Lys Arg Ala He Ala Leu He Thr 
210 215 220 

Asn His Pro Pro Glu Lys Pro Leu Phe Leu Tyr Leu Ala Leu Gin Ser 
225 230 235 240 

Val His Glu Pro Leu Gin Val Pro Glu Glu Tyr Leu Lys Pro Tyr Asp 

245 250 255 

Phe He Gin Asp Lys Asn Arg His His Tyr Ala Gly Met Val Ser Leu 
260 265 270 

Met Asp Glu Ala Val Gly Asn Val Thr Ala Ala Leu Lys Ser Ser Gly 
275 280 285 

Leu Trp Asn Asn He Val Phe He Phe Ser Thr Asp Asn Gly Gly Gin 
290 295 300 

Thr Leu Ala Gly Gly Asn Asn Trp Pro Leu Arg Gly Arg Lys Trp Ser 
305 310 315 320 

Leu Trp Glu Gly Gly Val Arg Gly Val Gly Phe Val Ala Ser Pro Leu 

325 330 335 

Leu Lys Gin Lys Gly Val Lys Asn Arg Glu Leu He His lie Ser Asp 
340 345 350 

Trp Leu Pro Thr Leu Val Lys Leu Ala Arg Gly His Thr Asn Gly Thr 
355 360 365 

Lys Pro Leu Asp Gly Phe Asp Val Trp Lys Thr He Ser Glu Gly Ser 
370 375 380 

Pro Ser Pro Arg lie Glu Leu Leu His Asn He Asp Pro Asn Phe Val 
385 390 395 400 

Asp Ser Ser Pro Cys Pro Arg Asn Ser Met Ala Pro Ala Lys Asp Asp 

405 410 415 




62 



Ser ser Leu Pro Glu Tyr Ser Ala Phe Asn Thr Ser Val His Ala Ala 
420 425 430 

He Arg His Gly Asn Trp Lys Leu Leu Thr Gly Tyr Pro Gly Cys Gly 
435 440 445 

Tyr Trp Phe Pro Pro Pro Ser Gin Tyr Asn Val Ser Glu He Pro Ser 
450 455 460 

Ser Asp Pro Pro Thr Lys Thr Leu Trp Leu Phe Asp .lie Asp Arg Asp 
465 470 475 480 

Pro Glu Glu Arg His Asp Leu Ser Arg Glu Tyr Pro His He Val Thr 

485 490 495 

Lys Leu Leu Ser Arg Leu Gin Phe Tyr His Lys His Ser Val Pro Val 
500 505 510 

Tyr Phe Pro Ala Gin Asp Pro Arg Cys Asp Pro Lys Ala Thr Gly Val 
515 520 525 

Trp Gly Pro Trp Met 
530 

2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 583 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Pro Leu Arg Lys Met Lys He Pro Phe Leu Leu Leu Phe Phe Leu 
1 5 10 15 

Trp Glu Ala Glu Ser His Ala Ala Ser Arg Pro Asn He He Leu Val 
20 25 30 

Met Ala Asp Asp Leu Gly He Gly Asp Pro Gly Cys Tyr Gly Asn Lys 
35 4 0 45 

Thr lie Arg Thr Pro Asn He Asp Arg Leu Ala Ser Gly Gly Val Lys 
50 55 60 

Leu Thr Gin His Leu Ala Ala Ser Pro Leu Cys lie Pro Ser Arq Ala 
65 70 75 80 

Ala Phe Met Thr Gly Arg Tyr Pro Val Arg Ser Gly Met Ala Ser Trp 

85 90 95 
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Ser Arg Thr Gly Val Phe Leu Phe Thr Ala Ser Ser Gly Gly Leu Pro 
100 10.5 110 

Thr Asp Glu He Thr Phe Ala Lys Leu Leu Lys Asp Gin Gly Tyr Ser 
115 120 125 

Thr Ala Leu He Gly. Lys Trp His Leu Gly Met Ser Cys His Ser Lys 
130 135 140 

Thr Asp Phe Cys His His Pro Leu His His Gly Phe Asn Tyr Phe Tyr 
145 150 155 160 

Gly He Ser Leu Thr Asn Leu Arg Asp Cys Lys Pro Gly Glu Gly Ser 

165 170 175 

Val Phe Thr Thr Gly Phe Lys Arg Leu Val Phe Leu Pro Leu Gin He 
180 185 190 

Val Gly Val Thr Leu Leu Thr Leu Ala Ala Leu Asri Cys Leu Gly Leu 
195 200 205- 

Leu His Val Pro Leu Gly Val Phe Phe Ser Leu Leu Phe Leu Ala Ala 
210 215 220 

Leu He Leu Thr Leu Phe Leu Gly Phe Leu His Tyr Phe Arg Pro Leu 
225 230 235 240 

Asn Cys Phe Met Met Arg Asn Tyr Glu lie He Gin Gin Pro Met Ser 

245 250 255 

Tyr Asp Asn Leu Thr Gin Arg Leu Thr Val Glu Ala Ala Gin Phe He 
260 265 270 

Gin Arg Asn Thr Glu Thr Pro Phe Leu Leu Val Leu Ser Tyr Leu His 
275 280 285 

Val His Thr Ala Leu Phe Ser Ser Lys Asp Phe Ala Gly Lys Ser Gin 
290 295 300 

His Gly Val Tyr Gly Asp Ala Val Glu Glu Met Asp Trp Ser Val Gly 
305 310 315 320 

Gin He Leu Asn Leu Leu Asp Glu Leu Arg Leu Ala Asn Asp He Leu 

325 330 335 

lie Tyr Phe Thr Ser Asp Gin Gly Ala His Val Glu Glu Val Ser Ser 
340 345 350 

Lys Gly Glu He His Gly Gly Ser Asn Gly He Tyr Lys Gly Gly Lys 
355 360 365 

Ala Asn Asn Trp Glu Gly Gly He Arg Val Pro Gly He Leu Arg Trp 
370 375 380 
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Pro Arg Val lie Gin Ala Gly Gin Lys He Asp Glu Pro Thr Ser Asn 
385 390 395 400 

Met Asp He Phe Pro Thr Val Ala Lys Leu Ala Gly Ala Pro Leu Pro 

405 410 415 

Glu Asp Arg He lie Asp Gly Arg Asp Leu Met Pro Leu Leu Glu Gly 
42n 425 430 

Lys Ser Gin Arg Ser Asp His Glu Phe Leu Phe His Tyr Cys Asn Ala 
435 440 445 

Tyr Leu Asn Ala Val Arg Trp His Pro Gin Asn Ser Thr Ser He Trp 
450 455 460 

Lys Ala Phe Phe Phe Thr Pro Asn Phe Asn Pro Val Gly Ser Asn Glv 
465 470 475 480 

Cys Phe Ala Thr His Val Cys Phe Cys Phe Gly Ser Tyr Val Thr His 

485 490 495 

His Asp Pro Pro Leu Leu Phe Asp He Ser Lys Asp Pro Arg Glu Arg 
500 505 510 

Asn Pro Leu Thr Pro Ala Ser Glu Pro Arg Phe Tyr Glu He Leu Lys 
515 520 525 

Val Met Gin Glu Ala Ala Asp Arg His Thr Gin Thr Leu Pro Glu Val 
530 535 540 

Pro Asp Gin Phe Ser Trp Asn Asn Phe Leu Trp Lys Pro Trp Leu Gin 
54 5 550 555 560 

Leu Cys Cys Pro Ser Thr Gly Leu Ser Cys Gin Cys Asp Arg Glu Lys 

565 570 575 

Gin Asp Lys Arg Leu Ser Arg 
580 

2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Lys Ser Ala Pro Phe Leu Phe Leu Leu Gly Leu Leu Gly Leu Val 
1 5 10 15 
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Thr Ala Gin Thr Gin Asp Pro Ala Leu Leu Asp Leu Leu Arg Glu Asn 
20 25 30 

Pro Asp Leu Leu Ser Leu Leu Leu Gin Ser Asn Glu His Arg Ala Pro 
35 40 45 

Leu Val Lys Pro Asn Val Val Leu Leu Val Ala Asp Asp Met Gly Ser 
50 55 60 

Gly Asp Leu Thr Ser Tyr Gly His Pro Thr Gin Glu Ala Gly Phe He 
65 70 75 80 

Asp Lys Met Ala Ala Glu Gly Leu Arg Phe Thr Asn Gly Tyr Val Gly 

85 90 95 

Asp Ala Val Cys Thr Pro Ser Arg Ser Ala lie Met He Gly Arg Leu 
100 105 110 

Pro Val Arg He Gly Thr Phe Gly Glu Thr Arg Val Phe Leu Pro Trp 
115 120 125 

Thr Lys Thr Gly Leu Pro Lys Ser Glu Leu Thr He Ala Glu Ala Met 
130 135 140 

Lys Glu Ala Gly Tyr Ala He Gly Met Val Gly Lys Trp His Leu Gly 
I 45 150 155 160 

Met Asn Glu Asn Ser Ser He Asp Gly Ala His Leu Pro Phe Asn His 

165 170 175 

Gly Phe Asp Phe Val Gly His Asn Leu Pro Phe Thr Asn Ser Trp Ser 
180 185 190 

Cys Asp Asp Thr Gly Leu His Lys Asp Phe Pro Asp Ser Gin Arg Cys 
195 200 205 

Tyr Leu Tyr Val Asn Ala Thr Leu Val Ser Gin Pro Tyr Gin His Lys 
210 215 220 

Gly Leu Thr Gin Leu Phe Thr Asp Asp Ala Leu Gly Phe He Glu Asp 
225 230 235 240 

Asn His Ala Asp Pro Phe Phe Leu Tyr Val Ala Phe Ala His Met His 

245 250 255 

Thr Ser Leu Phe Ser Ser Asp Asp Phe Ser Cys Thr Ser Arg Arg Gly 
26p 265 270 

Arg Tyr Gly Asp Asn Leu Leu Glu Met His Asp Ala Val Asp Lys He 
275 280 285 

Val Asp Lys Leu Glu Glu Asn Asn He Ser Glu Asn He He He Phe 
290 295 300 
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Phe He Ser Asp His Gly Pro His Arg Glu Tyr Cys Glu Glu Glv Glv 
305 310 315 320 

Asp Ala Ser He Phe Arg Gly Gly Lys Ser His Ser Trp Glu Gly Gly 

325 330 335 

His Arg He Pro Tyr He Val Tyr Trp Pro Gly Thr He Ser Pro Gly 
340 345 350 

He Ser Asn Glu He Val Thr Ser Met Asp He He Ala lie Ala Ala 
355 360 365. 

Asp Leu Gly Gly Thr Thr Leu Pro Thr Asp Arg lie Tyr Asp Gly Lys 
370 375 380 

Ser He Lys Asp Val Leu Leu Glu Gly Ser Ala Ser Pro His Ser Ser 
385 390 395 400 

Phe Phe Tyr Tyr Cys Lys Asp Asn Leu Met Ala Val Arg Val Gly Lys 

405 410 v 415 

Tyr Lys Ala His Phe Arg Thr Gin Arg Val Arg Ser Gin Asp Glu Tyr 
420 425 430 

Gly Leu Glu Cys Ala Gly Gly Phe Pro Leu Giu Asp Tyr Phe Asp Cys 
435 440 445 

Asn Asp Cys Glu Gly Asp Cys Val Thr Glu His Asp Pro Pro Leu Leu 
450 455 460 

Phe Asp Leu Met Arg Asp Pro Gly Glu Ala Tyr Pro Leu Glu Ala Cys 
465 470 475 480 

Gly His Glu Asp Val Phe Leu Thr Val Lys Ser Thr Val Glu Glu His 

485 490 495 

Lys Ala Ala Leu Val Lys Cys Thr Pro Leu Leu Asp Ser Phe Asp His 
500 505 510 

Ser He Val Pro Cys Cys Asn Pro Ala Asn Cys Cys He Cys Asn Tyr 
515 520 525 

Val His Glu Pro Gly Met Pro Glu Cys Tyr Gin Asp Gin Val Ala Thr 
530 535 540 

Ala Ala Arg His Tyr Arg Pro 
545 550 




